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Abstract. In this paper, we first describe the isothermal constrained 
loop extension DNA amplification (ICLEDA), which is a new variant 
of amplification combining the advantages of rolling circle amplification 
(RCA) and of strand displacement amplification (SDA). Then, we for- 
malize this process in terms of the theory of formal languages and show, 
on the basis of this formulation, how to manage OR and AND gates. We 
then explain how to introduce negation, which allows us to prove that, 
in principle, it is possible to implement the computation of any boolean 
function on DNA strands using ICLEDA. 

1 Introduction 

The first attempt to use DNA for solving computational problems was done by 
Adlcman pQ. Since that time several models of computation using DNA have 
been proposed, we refer to [1] for an overview. Boolean circuits play an im- 
portant role in this research. Their structure allows us to implement them in 
a simple way on DNA support. There are several designs simulating bounded 
fan-in circuits [TU] and semi-unbounded fan-in circuits [3] (in both cases AND 
and OR gates are used). Another approach can be found in [5] where circuits 
with NAND gates are simulated or in JT3] where the construction is based on 
the operation of hybridization of molecular beacons. 

In this article we use an approach similar to |9l2j . More precisely, we use only 
true values and the true value of any gate will be encoded by a specific DNA 
sequence. However, we don't use hybridization to assembly a resulting (answer) 
molecule like in these papers. Instead we use a special type of amplification to 
express the presence of signals (true values for some gates) which can further 
trigger the amplification of new signals following the circuit. Such an approach 
does not require temperature cycles and can be executed autonomously. 



The most common in- vitro DNA strand replication method (also called "DNA 
amplification") is based on PCR. This method is based on a series of primer 
extension cycles with changing temperature conditions to allow for strand sep- 
aration at the beginning of each cycle. 

Rolling-circle amplification (RCA) is another method of strand replication 
based on circular DNA molecules [3] and is inspired from the natural replication 
mechanisms of some viruses [3] . The important observation is that this method 
does not require changing the conditions of the test tube for DNA amplification 
and produces long single stranded DNA molecules including multiple comple- 
mentary copies of the circular template DNA fragment. This procedure was 
used in DNA computing as a basis for the simulation of the resolution refutation 
in0. 

The strand displacement amplification (SDA) is based on the ability of a 
restriction enzyme to nick a modified recognition site and the ability of a poly- 
merase to initiate synthesis at the nick and displace a downstream DNA strand 
during replication [IT] . Both above methods allow us to produce DNA strands in 
isothermal conditions. There are methods using both RCA and SDA, for example 
ramification-extension method (RAM) |12) . 

In this article we consider a new isothermal DNA replication method, called 
ICLEDA for Isothermal Constrained Loop Extension DNA Amplification, de- 
scribed in [S]. It makes it possible to produce short linear and single stranded 
DNA strands in isothermal conditions. Importantly in the perspective of a prac- 
tical application, the amplification is also possible when the template molecules 
are immobilized on a support. We formalize the amplification process in terms 
of formal languages. Such a formal system is constructed from a number of el- 
ements, which we call amplification loop complexes (or simply loop complexes) 
that can be in two states: blocked or unblocked. A loop complex in unblocked 
state produces infinitely the corresponding DNA strand (signal). The transition 
from a blocked to an unblocked state is done by annealing and primer exten- 
sion. As a result we can simulate a signaling cascade whose nodes correspond 
to AND and OR gates. The result is collected in one of the two output nodes, 
corresponding to the true or false value of the corresponding boolean function. 

We also consider a more general framework concerning double-stranded DNA 
molecules that are partially hybridized and that can be dissociated by annealing 
with other single stranded DNA and/or by primer extension. We give a descrip- 
tion of the corresponding objects and operations in terms of the formal language 
theory. This notation gives us a simple way to describe the simulation of logical 
gates and the construction of the circuit. 

2 The mechanism 

In this section, we first describe the ICLEDA amplification process defined in [5] 
on which the whole work is based. Then, we show how this mechanism allows 
us to devise a configuration, which we call the loop complex, which will later 
on allow us to implement logical gates in this context. Note that the word loop 



refers to the shape of the biophysical complex we consider rather than the com- 
putational device which is usually understood by this term, this is why the term 
complex is attached to loop in this denomination. The bio-physical description 
of the amplification process and of the loop complex is the content of Subsec- 
tion EO 

In a second subsection, we propose a formalization of the process described in 
Subsection 12. 1[ see Subsection l2.2l Later, we shall switch to a bit more abstract 
formalism which will be more suited for computation purposes. 

2.1 The amplification and the loop 

The ICLEDA amplification method designed in patent [8] is to some extend a 
combination of RCA and SDA amplification. We refer to this patent for more 
technical information. 

The mechanism is represented on Fig. [IJa). The loop complex is a circular 
molecule composed of two parts: the amplifiable fragment (2) and the loop link 
(1). The arrow represents the 3' end of the amplifiable fragment. We represent 
this molecule schematically as on Fig.QJb). For the sake of commodity we split 
the amplifiable fragment in 3 parts (3,4,5 on the picture) corresponding to the 
3' end, middle and 5' end of the amplifiable fragment. 
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Fig. 1. The loop complex (a): the amplifiable fragment (2) and the loop link (1). The 
schematic representation (b) highlighting the 3 components of the amplifiable fragment 

The amplification mix contains primers (101) which hybridize to the 3' part 
of the amplifiable fragment (3). They can be further extended by DNA poly- 
merase (102) present in the mix, see Fig.[2ta). The loop link (1) length is small 
compared to the length of the DNA fragment: typically 1 to 5 nm. It can be 
a simple chemical link joining the extremities of the DNA fragment, or a bio- 
chemical link between biotin moieties attached to the extremities of the DNA 
fragment via a streptavidin protein. The DNA fragment is also short in regards 
to the stiffness of double stranded DNA. In conditions where the biochemical 
replication reactions can take place, double stranded DNA molecules shorter 



than 300 - 500 nucleotides are too stiff for their extremities to come into close 
proximity. In other words, a circular DNA molecule shorter than 300 nucleotides 
cannot exist in full double stranded form, but is found as stretches of double 
stranded portions separated by single stranded portions. This is true also for the 
loop complexes used in ICLEDA. At some point the complex will be composed 
from a single stranded DNA having n nucleotides from the 5' end of the am- 
plifiablc fragment, a double-stranded DNA corresponding to the 3' part of the 
amplifiablc fragment, the extended primer and the linking loop of special length. 




Fig. 2. The amplification process: primer extension (a); maximum stretch of amplifiable 
fragment and the opening of the 5' end of the double strand (b); a second amplification 
started (c). Notation: link loop (1), amplifiable fragment (3,4,5), DNA polymerase 
(102), (extended) primer (101), single stranded fragment (104). 



Since the two extremities of the amplifiable fragment are linked to each other 
this gives a geometric constraint for the loop. In order to continue the reaction 
either the single strand part should be extended to the maximum or the double 
stranded part should open at the opposite extremity. At some level of tension 
the energetic preference will be to continue the extension of the primer by DNA 
polymerase, while the opposite end will detach by Brownian motion. So, at the 
same time the double stranded fragment will be opened at 5' part and one 
nucleotide will be added by DNA polymerase. However it should be noted that 
the number of nucleotides on the double stranded part remains unchanged, due 
to the geometric constraints of the loop. 

Since no more nucleotides are bound at the 3' end of the amplifiable fragment 
(3) at some point it becomes accessible for a hybridization with a new primer, 
see Fig.[2jb). The extension is blocked when it reaches the end on the amplifiable 
fragment (105) because of the presence of non-natural nucleotides in the link, 
see Fig. (He). 

Fig. |3] shows a loop complex (1) which has three attached primers being in 
different stages of the duplication. The first primer (101a) is paired to the 3' 
part of the amplifiablc fragment (3) and is ready to be extended. The second 
primer (101b) is in the process of the extension and it is paired by its 3' end to 
the central part (4) of the amplifiablc fragment where its extension continues, 



while its 5' part is a single stranded DNA. The third primer (101c) reached the 
end and is not extended anymore, but it is still paired by its 3' end to the 5' end 
(5) of the amplifiable fragment. The progression of the extension is blocked by 
the loop link (1), while its 5' part is progressively detached from the amplifiable 
fragment by the progression of the extension of the second primer. Besides its 
role in the amplification process, the link also enables the possibility of attaching 
the loop complex to a surface without hindering the replication mechanism. 
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Fig. 3. The amplification process: three molecules in different stages of duplication. No- 
tation: link loop (1), amplifiable fragment (3,4,5), DNA polymerase (102), (extended) 
primers (101a,b,c), single stranded fragments (104b, c). 

Now we remark that if in the mix a fragment of a single stranded DNA that 
matches the 3' part of the amplifiable fragment is present, then it can stick to 
the amplifiable fragment as shown on Fig.|4ja). We call such a strand a trigger. 
When a trigger is attached to the loop complex, no amplification can be done. A 
trigger can be detached from the loop complex by an activator that matches by 
its 3' end a part of the trigger strand as shown on Fig. 0Jb). Once bound to the 
trigger the activator can be extended by DNA polymerase and this will release 
the trigger, so the loop complex will be able to start the amplification process. 

2.2 Formalization 

As the main process described in the previous subsection deals with molecules, 
we shall represent them as words over the four-letter alphabet. However, we shall 
not represent individual nucleotides. We shall rather consider the places where 
reactions may occur. Consequently, we shall divide the words in several places, 
the sensitive ones and the neutral ones. If A is a molecule or a part of it, we 
denote by A' its complement in the Watson-Crick complementarity rule. Note 
that A 1 is also written in the opposite order of its letters with respect to A. 
Molecules are oriented and we denote by the symbol o the head (3' end) of the 
molecule, which implies the reading order from left to right for oA and from 
right to left for A'o. 



(a) 



(b) 



Fig. 4. The loop complex blocked by a trigger (a) and the hybridization with further 
extension used to remove the trigger (b). Notation: link loop (1), amplifiable fragment 
(3,4,5), trigger (11), DNA strand used to release the trigger (12) 



We consider that if A and A' are both present and if the configurations of 
the DNA strings to which they belong allow it, they bind each other. Consider 
that A occurs in a molecule M. We write this M = uAv with Horn possibly 
not present: we then say that u or v is empty. Consider that A' occurs in a 
molecule M'. We similarly denote this by M' = xA'y. We assume that A no 
more occurs neither in u nor in v and that, similarly A' no more occurs neither 
in x nor in y. We also assume that A and A 1 do not interact with neither of 
the molecules u, v , x and y and that these molecules also do no interact with 
each other. We express this by saying that u, v, x and y arc neutral parts of 
the molecules to which they belong. This allows us to focus only on A and A' 
which are called the sensitive parts of the molecules M and M'. We shall mark 
this difference between sensitive and neutral parts of molecules in the notation: 
sensitive parts will be denoted by capital letters and neutral parts will be denoted 
by lower case ones. 

When the molecules oM and No are both present, assuming that oM con- 
tains the active part A and that No contains A', we denote this by an additive 
notation: oM No. Now, if we replace oM and No by their expressions in 
terms of A and A' , we get ouAv yA'xo and we now know that as a result we 
obtain a complex as A and A' get bound to each other. We write the complex 
as ouAv (8> yA'xo. Hence the corresponding rule can be written as follows: 

o uAv yA'xo h ouAv ® yA'xo (1) 
As an example, we cannot write 

o uAv yBxo h ouAv €5 yBxo (2) 

unless A = wB't or B = rA's, we remember the reader that lower case letters 
denote neutral parts. To avoid unneeded repetition of rules we shall always use 
the rule in its explicit form ([1]), considering that in ([2]), we have neither A = wB't, 
nor B = rA's. 



We assume that the operation is commutative and associative, which 
corresponds to the fact that models a situation in which the components are 
independent and may freely combine or not and in all possible combinations. 

The loop complex can be formalized as follows: VF'uR' where u is the neutral 
part and F' with R' are the sensitive ones, corresponding to the parts 4,3,5 on 
Fig.[TJb). A trigger can be formalized as owX'Fz, where w and z are the neutral 
parts and X' , F are the sensitive ones. 

The working of the loop complex can be formalized as: 

VF'uR' h Fu'R. (3) 
Now, if there is a trigger, we have: 

VF'uR' otA'Fw h VF'uR' ® otA'Fw. (4) 

where u, t and w are neutral. From subsection 12.11 the result of (jH) blocks the 
application of 

We also have two rules for the trigger which occurs in formula (U): 

VF'uR' <g> otA'Fw zF'Axo h VF'uR' otA'F w ® zF'Axo (5) 

VF'uR' O otA'Fw Aco h VF'uR' otA'Fw ® F'Axo (6) 

In these formulas, as A binds with A' which, as a result, detaches the trigger from 
the loop complex, see Fig.@] And now, we can see that (3) applies. Note that both 
formulas and ^ produce almost the same result as the molecule F'Axo is 
present in the new complex in both cases. The formula © translates the property 
indicated in Sub-section 12.11 to detach the trigger attached to the complex in 
the left-hand side of the formulas, it is enough to present the beginning of the 
active molecules, the process of detachment will produce the continuation of the 
active parts. 

It can be noted that the formalism allows to explain why we obtain this rule. 
Indeed, it might be argued that as F' and F should also bind together, we could 
have the backward reaction: 

VF'uR' otA'Fw ® zF'Axo h VF'uR' ® otA'Fw zF'Axo 

The reason why we have not this reaction is that as A' is more visible for A that 
F is for F' , A attaches to the trigger which, as a result, lead to its freeing from 
the complex. 

In what follows, to simplify the notations, we introduce the following con- 
vention: we represent the triggers Axo and zF'Axo as A. This allows us to melt 
the formulas (|5|) and ([6]) in a single one, namely: 

VF'uR' ® otA'Fw A h VF'uR' otA'Fw ® A (7) 

If we have to explicit the form of A as Axo or zF'Axo, we shall speak of a 
realization of A. 



We can see this higher priority of A in the formalism. Remember that if we 
wish to read the words in the order given by a run other the molecule from its 
head to its tail, we have to read the word from the diamond to the opposite end. 
Define the apartness of a sensitive molecule X with respect to a molecule M as 
the number of sensitive parts of M between the diamond of M and the position 
of X. Denote it by apart(X, M). We can now define the apartness of X with 
respect to M and N, denoted by apart(X, M, N), where X is contained in M and 
X' is contained in N, as the expression apart(X,M) + apart(X' , N). Now, we 
can see that apart(A, VF'uR' <g> otA'Fw, A) = whatever the realization of A: 
as Axo or zF'Axo. Now we have that apart{F, VF'uR' <g> otA'Fw, A) = 1, also 
whatever the realization of A. As the apartness of A is lower than that of F, 
the reaction with A has a higher priority and so it takes place while the reaction 
with F does not. 

3 Implementing boolean functions 

In this section we consider the amplification loop VF'uR'. We assume that if a 
loop VF'uR' is unblocked, then there will be an unbounded number of copies 
of Fu'R. This assumption results from the observation that once started, the 
amplification could produce a large enough number of resulting molecules, even 
if the loop is blocked again afterwards. 

This implies that we can consider that initially all loops are blocked, other- 
wise we substitute them by a large number of DNA molecules corresponding to 
their result. So the computation in such a system consists in unblocking some 
loops in some order. This corresponds in a direct manner to boolean circuits 
where the electrical impulses are propagated in the circuit. The signals we use 
are always identified by the part at the beginning of the molecule, i.e. a signal 
A will be given by the string oAw. 

It is known that any boolean function can be computed by a boolean circuit 
with AND, OR and NOT gates. It is possible to omit the NOT gate if signals 
corresponding to false values of variables are used. We remark that in this case 
only one type of signal (corresponding to true) exists in the circuit and the 
result is obtained by positional information, i.e. there are 2 possible output 
wires corresponding to the true and the false values of the formula. 

From the properties stated in the previous section, we can devise the con- 
struction of the OR- and the AND-g&tes. First, we start with the Oi?-gate 
described in Subsection 13. II In Subsection l3.21 we describe an AND gate which 
is close to the construction of Subsection 13.11 In Subsection 13.31 we describe 
another variant which is closer to that suggested by [8]. 

3.1 The OR-g&te 

Considering rule ((3j) , we decide to interpret the production of the molecule 1Z as 
the emission of the boolean signal true. 



In this condition, if the trigger tA'Fwo is initially attached to a loop complex 
VF'uR', forming complex VF'uR' ®otA'Fw : rule ([7]) tells us that introducing 
the molecule A either as Axo or as zF'Axo, we obtain VF'uR' otA'Fw ® A. 
As is commutative, this corresponds to the fact that now rule applies to 
VF'uR': accordingly, we get again the signal true. 

It is not difficult to obtain a similar sequence of deductions with VG'vR' and 
the trigger orB'Gs. Introducing the molecule £>, either as Byo or as zG'Bxo, 
we shall also get TZ by applying the rules. The final construction for the gate is 
shown on Fig. [5] 




Fig. 5. The simulation of the OR gate. 



It is plain that if we introduce one of A or B we get also TZ: it is enough 
to introduce initially in the soup with both the loop complexes VF'uR' and 
VG'vR' as well as both triggers otA'Fw and orB'Gs, so that we may assume 
that, after a certain time, rule Q applies giving us: 

VF'uR' ® otA'Fw VG'vR' ® orB'Gs. 

^From rule ([7]) and the commutativity and associativity, we obtain that if A 
is poured into the soup, TZ will be produced, as the complex VF'uR' (g> otA'Fw 
will be decomposed into VF'uR' otA'Fw ® A. If B is poured, we get a similar 
result. If both A and B are introduced, we get a similar result as both kinds of 
complexes are present. In all these three cases, the molecule TZ is produced. 

Accordingly, we obtain that a soup which contains both the loop complexes 
VF'uR' ® otA'Fw and VG'vR' ® orB'Gs, it behaves like an Oi?-gate with re- 
spect to the occurrence or absence of the molecules A and B: the required sig- 
nal TZ occurs if at least one of them is present and only in this condition. 

We remark that the construction above can be extended to an n-ary OR 
gate. This gives the possibility to simulate semi-unbounded fan-in circuits. 



3.2 The AND-gate 

We can simulate an AND gate by considering two active regions on the loop, 
i.e. loops of form V 'F' A uF' B vR' , where F' A and F' B are active zones for trig- 
gers having A and B. Then the loop is blocked by two molecules as follows 



S7 F' A uF' B vR' <& otA' F A w ® oqB' F B s, see Fig. g^a). Now if both triggers oAt' 
and oBq' are present, then the loop complex can be unblocked: 

X7F' A uF' B vR' <8> otA'F A w <g> oqB'F B s@ oAtf © oBq' h 

h VF A uF' B vR' ®otA'F A w®oAt' @oqB'F B s®oBq' h 

h VF A uF B vR' ®otA'F A w®oAt'®oqB'F B s®oBq' (8) 

In the above derivation the unblocking can start by the signal At' , but fol- 
lowing the commutativity of © it yields the same result. 

It is clear that if only one of the triggers At' or Bq' is present, then the 
loop complex is only partially unblocked (either V ' F' A uF' B vR' <g> otA'F A w or 
V ' F' A uF' B vR' <g> oqB' F B s) and cannot produce the resulting signal. 

3.3 The initial AND-gate 

Another variant of the AND gate is described in [5] . Like in the previous case it 
is also a complex of 3 molecules, however the loop complex is bound in only one 
place. Its construction is done in two stages: the loop complex VF A wR' is blocked 
by the trigger oF A A'. After that the molecule AB'o is added into the solution 
and it will stick to the A' site. Hence, the complex \7F A wR' <g> oA'F A <E> AB'o 
will be formed, see Fig. IHJb) . We remark that since this complex is formed during 
the preparation stage, we can insure that no molecules oAu, (u ^ B) or otF A Av 
are present in the solution. 

Now during the computation the loop complex can be unblocked as follows: 

VF A wR' <g>oA'F A <g> AB'o@oBu@oAv h 

h VF' A wR' ®oA'F A @AB' o®oA'Bu@oAv h 

h VF' A wR' ®AB' o®oA'Bu@oA'F A ®oF' A Av (9) 

We remark that unlike the previous case this construction is not symmetric, 
i.e. first the signal oBu is removing the molecule AB'o from the complex, freeing 
the site A', which can be bound after that by the signal oAu that finally unblocks 
the loop. 

3.4 Implementing boolean functions 

It is known that every boolean function of n variables can be implemented by 
a boolean circuit using AND, OR and NOT gates. It is possible to eliminate 
the NOT gate by considering that only true signals can circulate in the circuit. 
In this case the input of the circuit is not the true or false value for the same 




Fig. 6. The simulation of the AND gate with two active regions on the loop (a) and 
with one active region on the loop (b) 



variable x, but rather a true value for x or for -ix. The output is also modified: 
instead of a single output having one of the values true or false, there are two 
outputs (marked by true and false) and a true value in some of the outputs 
indicate that the output value of the circuit is true or false. 

For a boolean formula <f> such a modified circuit can be constructed by a 
superposition of two circuits, one computing <f> and the other one computing —\<f>. 

If we consider that the two output nodes are combined into a Active output 
node then such a circuit is a DAG with the root being the output node and the 
leaves being the input variables and their negations. 

We remark an important similarity between traditional electronic implemen- 
tation of boolean circuits and our implementation: if a signal (represented by 
an electric charge in electronics and by DNA molecule in our case) appears at 
some moment during the computation, then it is sufficiently strong and does 
not disappear in the consequent steps. This allows us to make a direct analogy 
between two implementations and use similar construction techniques. This is 
different from other approaches of simulation of circuits by DNA computing, as 
we do not need additional amplification phase anymore. 

More precisely, let / : B n —> B, B = {0, 1} be a boolean function and let D = 
(V, E, F) be the circuit implementing this function (where V = {1, . . . , n} is the 
set of vertices, E C V x V is the set of edges and F : V — > {x p , -*x p , AND m , ORk , 
NOT q , out}, 1 < p, q, 77i, fc < n is the function that labels vertices of the circuit). 
Then for any inner node y such that F{y) = OR m (resp. F(y) = AND m ) we 
construct an OR loop (resp. AND loop) as discussed in 13. II (resp. 13. 2j) . The final 
gate will send the signal to the output node (the root of the circuit (F(x) = out) 
). A similar construction should be performed for -if. The final assembly is the 
union of these two circuits. 

In order to compute the result in the initial configuration signals correspond- 
ing to Xk (where Xk is either Xfc or -iXfc) should be introduced. 

We give below an example of such a construction for the following function: 
/(xi,X2,X3) = (-ixi A X2) V (xi A x 2 A -1X3). 



The ordinary boolean circuit computing / is given below: 



xi NOT 




Next we replace the NOT gate by considering that only true values can 
transit the circuit. This gives the following structure for / (we also numbered 
the gates): 




Now we combine the two constructions. We also add labels to edges going 
out from the same left node (corresponding to a concrete signal produced by the 
corresponding gate) . 




Having in mind that the true value of some node is represented by the pres- 
ence of the corresponding signal (that labels the edge), it becomes clear that 
this construction can be directly implemented using loop complexes and corre- 
sponding signals by 4 AND gates and 2 OR gates. 

The signals Si — Sq correspond to the input values and the signals Sn and 
512 to the output. So the computation starts by giving input signals (taking care 
of not having an input x and ^x at the same time). Then the gates will act in 
cascade and one of two output signals (Su if / is true or S12 if / is false) will 
be obtained. 

4 Conclusions 

In this article we present a new method for the simulation of boolean circuits. 
The use of ICLEDA offers many advantages like a single volume and unchanged 
reaction conditions. This implies that the corresponding implementation will 
not need any additional intervention. Moreover, since the loop complexes can be 
easily attached to the support it is possible to reuse the circuit by washing the 
tube and by introducing trigger molecules to block the loops. Another advantage 
of the method is that the signal molecules (corresponding to the true value of 
some gate) are of a small length; moreover, by introducing compartments it is 
possible to share some of the signals. 

As a further work it remains to experimentally verify the functioning of the 
method. A partial attempt for this is done in [8] where an evidence of the fea- 
sibility of the basic blocs is given. Another interesting question is the further 
investigation of the framework for the partial hybridization of DNA strings in- 
troduced in this article. 
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